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DATA-DRIVEN SOBOLEV TESTS OF UNIFORMITY ON 
COMPACT RIEMANNIAN MANIFOLDS 

By P. E. Jupp 

University of St. Andrews 

Data-driven versions of Sobolev tests of uniformity on compact 
Riemannian manifolds are proposed. These tests are invariant un- 
der isometries and are consistent against all alternatives. The large- 
sample asymptotic null distributions are given. 

1. Introduction. A fundamental hypothesis in directional statistics is 
that of uniformity of a distribution on a compact Riemannian manifold, 
such as the circle, sphere or rotation group. An important large class of 
tests of uniformity consists of Gine's [7] Sobolev tests. Each Sobolev test 
is specified by a sequence a\,a,2,--- of real numbers satisfying a suitable 
square-summability condition, which is given in (2.1) below. Sobolev tests 
for which only a few are nonzero are often simple to calculate, whereas 
those for which all are nonzero are consistent against all alternatives. 
The vastness of the class of Sobolev tests presents a problem, as the statisti- 
cian needs to choose the sequence ai,a2, ■■ ■ ■ This paper solves the problem 
by providing a simple data-driven version of Sobolev tests of uniformity on 
arbitrary compact connected Riemannian manifolds, in which the sequence 
a\,a2, ■■■ is not chosen in advance, but is specified automatically by the 
data. These data-driven tests are invariant under isometries and consistent 
against all alternatives. 

The construction of the data-driven tests is based on the fact that Sobolev 
tests of uniformity can be regarded as weighted score tests of uniformity 
within some of the canonical exponential models introduced by Beran [1]. 
Thus, they are analogous to Neyman's [21] smooth tests of uniformity on 
the unit interval. Data-driven versions of Neyman's smooth tests were intro- 
duced by Ledwina [18] and refined in Inglot, Kallenberg and Ledwina [12] 
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and Kallenberg and Ledwina [16, 17]. Consistency and other desirable prop- 
erties of these tests were established by Kallenberg and Ledwina [15], Inglot, 
Kallenberg and Ledwina [12], Inglot [11] and Inglot and Ledwina [13, 14]. 
By suitable adaptation of the version of Ledwina's test considered in [12, 16] 
and [17], Bogdan, Bogdan and Futschik [2] obtained a data-driven test of 
uniformity on the circle. The data-driven tests presented in this paper gener- 
alize those of Bogdan et al. from the circle to arbitrary compact Riemannian 
manifolds. 

In Section 2, the construction of Sobolev tests of uniformity is reviewed 
and the data-driven versions are introduced. Results on consistency and 
other large-sample asymptotic properties are given in Section 3. Section 4 
gives details in the important cases in which the sample space is a sphere, 
a projective space or the rotation group 50(3). Some simulation results are 
reported in Section 5. 

An outline of the requisite differential geometry can be found in Section 
2 of [7] or the Appendix of [9]. More detailed accounts are given in [8] and 
[3]. 

2. Data-driven Sobolev tests of uniformity. 

2.1. Sobolev tests of uniformity. Let M be a compact connected Rie- 
mannian manifold without boundary. The Riemannian metric determines 
the uniform probability measure \i on M. The intuitive idea behind Gine's 
[7] Sobolev tests of uniformity is to map the manifold M into the Hilbert 
space L 2 (M, fi) of square-integrable real- valued functions on M by a function 
t : M -> L 2 (M, u) such that if x is uniformly distributed then i£[t(x)] = 0, 
and to reject uniformity if the sample mean of t(x) is "far" from 0. 

The standard way of constructing such mappings t is due to Gine [7] 
and is based on the eigenfunctions of the Laplacian operator on M acting 
on smooth real- valued functions on M. For k > 1, let denote the space 
of eigenfunctions corresponding to A^, the fcth nonzero eigenvalue, and put 
dj, = dim Ek. Then there is a well-defined map, t& of M into E^ given by 

<4 

t k(x) = ^2fi(x)fi, 
i=i 

where {/j : 1 < i < d^} is any orthonormal basis of E^. If 01,02, •• • is a se- 
quence of real numbers such that 

00 

(2.1) a l d k < 00 

k=l 

then 

00 

(2.2) x^t(x) = Y,a k tk(x) 

k=i 
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defines a mapping t of M into L 2 (M,fi). The resulting Sobolev statistic 
evaluated on observations x±, . . . , x n on M is 

2 



T n = - 



i=l 



1 



i=ij=i 



where (,) denotes the inner product on L 2 (M, fi) given by 



</, g) 



M 



f(x)g(x) dfi(x) 



The corresponding Sobolev test rejects uniformity for large values of T n . 

2.2. Score tests of uniformity. Any vector (6\, . . . ,6^) in ©f = i-Ej de- 
termines a distribution on M with density (with respect to the uniform 
distribution) 

(2.3) /(x; 1 ,...,6 k )= exp jxj<0„ t,(x)) - . . . ,0 fc ) j, 

where re(#i, . . . , #&) is a log normalizing constant. The class of such distri- 
butions was introduced by Beran [1]. The quadratic score statistic Sk in 
the score test of uniformity (i.e., 9j = for j = 1, . . . , k) in the exponential 
model (2.3) is 

2 



(2.4) 



n 



1 



n 



i=i 



where is t defined by (2.2) with 
(2.5) aj -- 



for j < k, 
for j > k. 



Thus Sk is the Sobolev statistic corresponding to (2.5). These t(^) are used 
in Hendriks's [9] density estimates which are given by 

1 

n 



fk(x) = -£(t( fe )(xi),t( fe )(a;)). 



i=i 



2.3. Data-driven tests of uniformity. A major problem in practice with 
the score tests of Section 2.2 is the need to choose a suitable k. The solution 
proposed here is for this choice to be made by the data, using a modification 
of Schwarz's [24] Bayesian Information Criterion (BIC) selection rule, as 
considered by Inglot et al. [12], Kallenberg and Ledwina [16, 17], Inglot [11] 
and Inglot and Ledwina [14] for observations on the line and by Bogdan et 
al. [2] for observations on the circle. 
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The choice of k in the data-driven versions of the score tests of Section 2.2 
is based on the penalized score statistic 

(2.6) B s (k) = S k -u k logn, 
where 

k 
i=l 

The second term on the right-hand side of (2.6) penalizes higher-dimensional 
models. The value of k is chosen as k, where 

(2.7) k = milk G N:Bs(k) = sup Bs(m)\. 

I mGN J 

(Recall that inf0 = oo.) This procedure is an analogue for score tests of 
Schwarz's [24] BIC selection rule. The data-driven score tests reject the null 
hypothesis of uniformity for large values of S k - 

The main large-sample properties of these tests can be stated intuitively 

as: 

(i) Under the null hypothesis, k tends to be near 1, so that the test 
statistic S k is "often" simple (for large samples); 

(ii) The asymptotic null distribution of S k is chi-squared; 

(iii) Under the alternative hypothesis, k tends to rise to a value large 
enough to cause rejection of the null hypothesis. 

Rigorous versions of (i), (ii) and (iii) are given in Theorems 3.1, 3.2 and 3.3, 
respectively. 

A useful geometrical tool in deriving the properties of data-driven score 
tests is the spectral function, which can be defined as 

(2.8) e(x,y,T) = — \— £ {t k {x)Mv)), x,y£M,T>0, 

vol ( M ) xt^T 

where vol(M) denotes the Riemannian volume of M. Note that 

1 2 

( 2 -9) e(a,a,A fc )= ||t (fc) (x)|| . 

A key property of the spectral function is 

sup \T- m / 2 e(x,x,T)(2^) m T(m/2 + l) - 1| = 0{T~ 1 ' 2 ), 

xeM 

(2.10) 

T — > oo, 

where m is the dimension of M and T denotes the gamma function. See, 
for example Theorem 1.1 of [10] or equation (2.25) of [6]. Let N(T) be the 
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number of eigenvalues A (counted with their multiplicities) of the Laplacian 
with A < T. Then 



e(x,x,T) dvol(x) = vol(M)iV(T), 



vol(M) 



(2.12) v k ~ ^ ^wTTZ: /n i in A fc as /c ^ oo. 



M 

and so (2.10) yields Weyl's formula 

(2.11) lim T~ m/2 N(T) - 

v ; r^oo v ' (2 1 /^r) m r(m/2 + l) 

See, for example, page 243 of [20] or page 9 of [4]. Thus 

vol(M) xm/2 
(20F) m r(m/2 + l)' 

Combining (2.10) with (2.11) gives 

(o i q\ vol(M)e(x,x,\ k ) . 

(2.13) ► 1 (uniformly m x) as k — > oo. 

The following complement of (2.13) will be useful. 

Proposition 2.1. Ifx^ythen 
e(x,y,A fc ) 

(2.14) >0 as k -> cx). 

Since the proof of Proposition 2.1 is rather technical, it is postponed to 
the Appendix. 

If k = oo, then S k is not defined. The following proposition shows that 
this occurs with probability zero, except in very small samples. 

Proposition 2.2. If n > 3 then for random samples of size n from a 
continuous distribution on M , 

(2.15) P(k = oo)=0. 

Proof. S ince the distribution is continuous, the observations x\ 7 . . . , x n 
are distinct with probability 1. In this case, (2.4), (2.8), (2.9), (2.13), and 
(2.14) give 



1 ( 71 n 1 

— \ J2\\ t (k)( X i)\\ 2 + J2J2( t (k)(Xi),t {k) ( Xj )) \ 

Vk U=i i=i j& > 

1(M) fyv e(xi,Xi,X k ) | e(xj,Xj, X k ) \ 

I i=i ^ i=i jVi ^ fc J 

as — > oo. 



vo 

n 
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It follows that 

B s (k) - BsQ) 1 . , 

> 1 — log n as k —> oo. 

v k 

Thus, if n > 3 then Bs(k) < -Bs(l) for large enough k, so that k < oo. □ 
Remark. In practice k is calculated not by (2.7) but as 
inf < k : B$(k) = sup Bs{m) > 

I l<m<K ) 

for some suitable K. Tables 1, 3 and 5 in Section 5 indicate that it is rea- 
sonable to take K = 5 for M = S 2 ,RP 2 or SO(3). Calculation of S~ k takes 
approximately K times as much effort as calculation of a Sobolev test which 
is consistent against all alternatives. Thus, the extra computational cost of 
using a data-driven test is small. 

3. Asymptotic properties. 

Theorem 3.1. Under uniformity, 

lim P(fc = l) = l. 

n— »oo 

Proof. Since {/$ : 1 < i < d k } is a orthonormal basis of E k , if x is uni- 
formly distributed then 

(3.1) E[t {k) (x)}=0 and £[t (fc) (a;)t (fc) (aO / ] = I„ h . 
Straight forward calculation gives 

(3.2) E[S k ] = v k 
and 

I ( n n \ 

\i=i i=i j^i ) 



1 

— ^var(||t (fc) (xi)|| 2 ) 
n i=i 

2 n 



+ ^ZlZ! tr ( £; [ t (fc)( Xi ) t w( :Ej ) / ]- E [ t (fc)( x i) t w( :z; i)']) 
(3.3) = ivar(||t (fc) (x)f) + 2^1 - ^\v k . 
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It follows from (2.9)-(2.12) that there is a positive A such that 

\\\Hk){x)\\ 2 -Vk\<Av k {m - l)/m for A; = 1,2,... and x E M. 

Thus 

var(||t (fe) (*)|| 2 ) < A 2 ^ 1 - 1 '^ for k = 1, 2, ... . 
Combining this with (3.3) gives 

(3.4) var(S,0<ylV (1 - 1/m) +2^. 

It follows from (2.11) that there is a positive Tq such that 

(3.5) T > T 2 < iV(3 2/m T) /N(T) < 4. 

Put //i = Ai, choose \x 2 such that N(fj, 2 ) = ^2, and define //3,/i4, ... by 

(3.6) /x, = 3 2(/ - 1)/m ^ 2 for / > 3. 

From (3.5) and (3.6) there are positive constants C\ and C 2 such that 

(3.7) N(m) > 2 l Ci for / > 1, 

(3.8) N(jii)/N(jh^) < C 2 for I > 2. 

Suppose that ^1 < Ar, < /ij+i, for some / > 2. Define r by iV(^ +1 ) = v T . 
Since fc 1— > Sk and iV are increasing functions, 

S r - TV (/ij) logn >Sf,- z^logn > Si - v\ logn, 

so that, by (3.2), 

S r -E[S r ] >Si-uilogn + N(m)\ogn-N(m +1 ) 

> (N(ji l )-v 1 )logn-N(iii +1 ) 

(3.9) 

= iV( W ){[l - ui/NQh)] logn - N( W+ i)/N( W )} 

> N{^){[d 2 /{di + da)] logn - C 2 }, 

which is positive if logn > C 2 {di + d 2 )/d 2 . In this case, it follows from (3.9), 
Chebyshev's inequality and (3.4) that 

P{m < Xj, < Mi+i) 

var(5 r ) 



< 



{N{m)[d 2 /(di+d 2 )\ogn-C 2 ]} 2 

A 2 ( Njni +1 ) \ 2 2 Nij*i+i) \ f d 2 
iV(u m ) 2/m V N(pi) J N(jii) N(m) )\di+d 2 
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Provided that logn > 1Ci{d\ + d,2)/d,2, using (3.8) then gives 

(3.10) P( W <A t < w+1 )<( A '% + -^\ 4*L±j£ . 

* IJV(mi+i) 2/ ™ »(«)/ 4(logn) ! 

Summing (3.10) over I = 2, 3, . . . and using (3.7) and (2.15) gives 

P(A;> 1) <C 3 (logn)- 2 

for some positive C3, and so 

P(fc>l)->0 asn^oo. □ 

The asymptotic null distribution of <SV can now be found. 

Theorem 3.2. Under uniformity, 

c d 2 

where denotes convergence in distribution and % 2 denotes the chi-squared 
distribution with v\ degrees of freedom. 

Proof. This is a consequence of Theorem 3.1 and the fact [which can 
be obtained by applying the central limit theorem to t^(x) and using (3.1)] 
that, under uniformity, for any fixed k, 

Sk~^xl k asrwoo. □ 

The following theorem guarantees consistency. 

Theorem 3.3. The test which rejects uniformity for large values of St 
is consistent against all alternatives to uniformity. 

PROOF. For any nonuniform distribution, there is a natural number K 
such that E[t K (x)} / 0. For 1 <j < K, 

P{k = i) < P(Sj - z/jlogn >S K ~ ^logn) 

< P(n||t^|| 2 < (vk — Vj) logn) 

— > as in 00, 

because ||t^|| 2 — > ^\E\tK(x)\ || 2 almost surely as n — > 00 and (logn)/n— >0 as 
n — > 00. Then, for any positive C, 

P(S~ k >C)> P(S K > C) - P(k < K) 

> P{n\\t K \\ 2 > C) - P{k < K) 

— > 1 as n — > 00. 

Since has a nondegenerate limiting distribution under uniformity, it fol- 
lows that the test is consistent. □ 
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4. Examples: spheres, projective spaces and the rotation group. In order 
to calculate the statistic St, explicit expressions for {tk(x),tk(y)) and i/j. 
are required. This section gives such expressions where M is a sphere, a 
projective space or the rotation group 50(3). 

4.1. Spheres. It follows from the formula for the cosine of a difference 
and from Proposition 2.1 of [22] that, for x,y in S^" 1 (the unit sphere in 



2cos{k6), tfjP = 2, 

(4.1) <t fc (x),t fc (y)) = <j ( 1+ k\ C a {x , yl ifp>2) 



where cos 6* = x'y, ol = p/2 — 1 and C% denotes the Gegenbauer polynomial 
of degree k. The expression for dk given on page 171 of [22] yields 

In the case of S 2 , (4.1) reduces to 

(t fe (x),t fe (y)) = (2fc + l)P fe (x'y) 

(as in (6.7) of [7]), where P& denotes the Legendre polynomial of degree k, 
and (4.2) reduces to 

v k = fc(fc + 2). 

4.2. Projective spaces. Since the eigenspace E)~ of the Laplacian on RP P_1 
(the projective space of one-dimensional subspaces of W), can be identified 
with the eigenspace .E^fe of the Laplacian on S p ~ l , it follows from (4.1) that, 
for ±x,±y in MPP' 1 , 

2cos(2fc0), ifp = 2 

(t k (±*)M±y)) = < f 1 + ^W (xV) , ifp>2) 



a 

where cos# = x'y, and from (4.2) that 



p + 2i-3\ , (p + 2i-2 

i=l 



p-2 J V P-2 
In the case of MP 2 , (4.3) and (4.4) reduce to 

<tfc(±x),t fc (±y)) = (4fc + l)P 2 fe(x'y) 

and 

v k = k(Sk + 2). 
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4.3. The rotation group 50(3). There is a standard identification of the 
rotation group 50 (3) with MP 3 given by the mapping which sends ±u = 
±(iii, • • • , U4)' in MP 3 to the matrix 

u l + u \ ~ u 3 ~ u i ~ 2(uiU4 — U2U3) 2(uiU3 + M2^4) 
2 (ill + U2U3) U 2 + U 2 — U 2 — U 2 — 2(«iU2 — U3II4) 
— 2(nili3 — U2II4) 2(^1^2 + 113114) Hi + u\ — u\ — U3 

in 50(3). Combining this with (4.3) and (4.4), for p = 4 shows that on 50(3) 

(t fc (X), t fc (Y)> = (2k + 1)C 2 \. ( tr(X, J ) + 1 ) , X,Ye 50(3), 

k(4k 2 + 12k + 11) 
- fc = § • 

5. Simulation study. Tables 1-6 summarize the null distributions of k 
and for 5 2 , MP 2 and 50(3), based on 10,000 simulations. Comparison 
with the simulations for the circle 5 1 by Bogdan et al. [2] show that, as 
n — > 00, k converges more rapidly to 1 and the distribution of 5^ converges 
more rapidly to its limiting x 2 distribution for 5 2 , MP 2 and 50(3) than for 
5 1 . 

Tables 2, 4 and 6 show that comparison of the observed values of 5^ 
with the upper 10% and 5% quantiles of the large-sample asymptotic \ 2 
distribution is reasonable for n > 30 for 5 2 , n > 20 for MP 2 and n > 25 for 
50(3). Since the asymptotic distribution of St is x 2 > it follows from general 
results of Cordeiro and Ferrari [5] that there are polynomial modifications 
of St which bring its null distribution closer to the asymptotic distribution. 
The cubic modification 5| of S^ given by 

(5.1) 5? = (l+ L37 ~ °- 31 ^ }5 £ for M = 5 2 , 



n 



1.91 - 0.215 £ 1 9 
(5.2) 5| = <jl + tl S ~ k forM = MP 2 , 

Table 1 

Empirical distribution of k (based on 10,000 simulations) from samples of size n from 

the uniform distribution on S 2 





n = 5 


n = 10 


n = 15 


n = 20 


n = 25 


n = 30 


1 


8407 


9635 


9826 


9900 


9951 


9964 


2 


1060 


338 


167 


97 


49 


36 


3 


357 


25 


6 


3 








4 


92 


2 


1 











5-10 


84 
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Table 2 

Empirical upper tail probabilities P(Sj, > xl;a) (upper line) and P{S? > xl;a) (lower 
line) from samples of size n from the uniform distribution on S 2 . The estimates are 
based on 10,000 simulations. The modified statistic S 1 ^ is defined in (5.1). Bold figures 

indicate values in the interval a ± 2^J a(\ — a)/ 10,000 



Q 


n = 5 


n = 10 


n = 15 


n = 20 


n = 25 


n = 30 


0.10 


0.232 


0.132 


0.114 


0.108 


0.110 


0.102 




0.078 


0.114 


0.105 


0.101 


0.102 


0.097 


0.05 


0.187 


0.078 


0.062 


0.058 


0.058 


0.051 




0.000 


0.056 


0.050 


0.048 


0.048 


0.043 


0.01 


0.120 


0.043 


0.023 


0.017 


0.014 


0.011 




0.000 


0.000 


0.018 


0.015 


0.010 


0.008 



Table 3 

Empirical distribution of k (based on 10,000 simulations) from samples of size n from 
the uniform distribution on RP 2 





n = 5 


n = 10 


n = 15 


n = 20 


n = 25 


n = 30 


1 


9044 


9900 


9968 


9986 


9996 


9999 


2 


759 


95 


32 


14 


4 


1 


3 


139 


4 














4 


41 


1 














5-8 


17 


















Table 4 

Empirical upper tail probabilities P(Sj. > xi;a) (upper line) and P(S| > xi;a) (lower 
line) from samples of size n from the uniform distribution on RP 2 . The estimates are 
based on 10.000 simulations. The modified statistic 5| is defined in (5.2). Bold figures 

indicate values in the interval a ± 2-^/a(l — a)/10,000 



a 


n = 5 


n = 10 


n = 15 


n = 20 


n = 25 


n = 30 


0.10 


0.162 


0.099 


0.100 


0.098 


0.098 


0.100 




0.116 


0.099 


0.094 


0.099 


0.095 


0.101 


0.05 


0.123 


0.051 


0.052 


0.049 


0.048 


0.048 




0.028 


0.042 


0.042 


0.045 


0.040 


0.046 


0.01 


0.100 


0.018 


0.013 


0.009 


0.010 


0.009 




0.000 


0.009 


0.006 


0.006 


0.005 


0.008 



r 5.496 -0.6365f+ 0.0185? i 
(5.3) S| = I I + — ^ \S k for M = 50(3) 
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Table 5 

Empirical distribution of k (based on 10,000 simulations) from samples of size n from 
the uniform distribution on 50(3) 





n = 5 


n = 10 


n = 15 


n = 20 


n = 25 


n = 30 


1 


9790 


9997 


10000 


10000 


10000 


10000 


2 


189 


3 














3-4 


21 


















Table 6 

Empirical upper tail probabilities P{S%, > X%a) (upper line) and P(S^ > xl;a) (lower 
line) from samples of size n from the uniform distribution on £0(3). The estimates are 
based on 10,000 simulations. The modified statistic is defined in (5.3). Bold figures 

indicate values in the interval a ± 2-^/a(l — a)/10,000 



a 


n = 5 


n = 10 


n = 15 


n = 20 


n = 25 


n = 30 


0.10 


0.094 


0.087 


0.093 


0.093 


0.097 


0.095 




0.095 


0.089 


0.098 


0.096 


0.097 


0.095 


0.05 


0.055 


0.042 


0.045 


0.044 


0.047 


0.047 




0.043 


0.037 


0.045 


0.046 


0.045 


0.045 


0.01 


0.027 


0.008 


0.010 


0.008 


0.009 


0.009 




0.024 


0.008 


0.009 


0.012 


0.009 


0.009 



(obtained by cubic regression of the empirical quantiles of the limiting dis- 
tribution on the empirical quantiles of S^) is reasonable at the 10% and 5% 
levels for n > 15 for S 2 , RP 2 and 50(3). 

Table 7 

Empirical power P(S > X3;a) (based on 10,000 simulations) of tests of uniformity on S 2 
against alternative (5.4) of mixture of Fisher distributions. F„ denotes Ginffs statistic; 

St is defined in (5.1) 



a 


S 


n = 5 


n = 10 


n = 15 


n = 20 


n = 25 


n = 30 


0.10 


F n 


0.101 


0.117 


0.143 


0.151 


0.177 


0.197 




s% 


0.301 


0.201 


0.196 


0.190 


0.205 


0.215 




si 


0.103 


0.176 


0.184 


0.183 


0.200 


0.210 


0.05 




0.050 


0.060 


0.070 


0.078 


0.096 


0.106 




St 


0.261 


0.156 


0.145 


0.147 


0.162 


0.171 




s l 


0.000 


0.127 


0.137 


0.138 


0.157 


0.165 


0.01 


F n 


0.000 


0.000 


0.000 


0.000 


0.000 


0.000 






0.182 


0.124 


0.114 


0.111 


0.128 


0.137 




st 


0.000 


0.000 


0.104 


0.107 


0.124 


0.133 
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A common alternative to uniformity on S 2 is a mixture of two Fisher 
distributions with modes that are antipodal, so that the probability density 
function is 

(5.4) /(x; /x, k) = - ~ (exp{K/i'x} + exp{-K//x}) 

with fi G S 2 . The main Sobolev test in common use on S 2 that is consistent 
against all alternatives is Gine's [7] F n test. (See also page 209 of [19].) 
Table 7 gives the empirical power (based on 10,000 simulations) of some 
tests of uniformity against the alternative of density (5.4) with k = 2. The 
tests reject uniformity for P(S > xi-a)) where S is F n , the statistic St of the 
data-driven test, and the modification S"| defined in (5.1). The data-driven 
test is definitely more powerful than Gine's test for the small sample sizes 
considered. 

APPENDIX: PROOF OF PROPOSITION 2.1 

This proof is based on that of the asymptotic behavior of the heat kernel 
given in Chapter 5 of [23]. 

Let D denote the de Rham operator d + d* acting on the complex exterior 
forms on M. Then D 2 = A is the Laplacian operator on the complex exterior 
forms on M. For T = 1,2,..., choose a smooth function ipx : R — > R such that 

1, if |s|<(2A T + Ar+i)/3, 
0, if |s| > (A r + 2A T+ i)/3. 

The kernel of </?t(A) is the spectral function e(-,-,T) given by (2.8). Given 
distinct points x and y in M, define 5 as the Riemannian distance between 
x and y. Choose a smooth function ip : R — ► R such that 

\t\<5/3 4>(t) = l, 

(A.l) 

|t| > 25/3 => V(*) = 0- 

Since and ip are in the class 5(R) of rapidly decreasing functions with 
all derivatives rapidly decreasing, their Fourier transforms 0t and ip are 
in <S(R), and so there are functions f\ x and f%T m <5(R) with Fourier 
transforms /i^ and /2,T satisfying 

A,t(A) = — 0t(A)V(A), 

(A.2) 

/ 2 ,t(A) = — ^ t (A)(1-V(A)). 



^t(s) 



Then 



/ 1T (A) + / 2iT (A) = — ^r(A). 
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Since the functions /i t and fi,T are i n the space 1Z(R) of rapidly decreas- 
ing functions, the operators /i ; r(A) and /2,t(A) have smooth kernels (see 
Proposition 5.8 of [23]). Denote these kernels by e\ T an d e 2,T- As 

1 c°° 

f 1>T (A) = — ipT(\ 2 )e iXD tfj(\) dX, 

it follows from (A.l) and the fact that D has unit propagation speed (cf. 
Proposition 5.5 of [23]) that e± T is supported within 25/3 of the diagonal. 
Thus 

(A.3) e(x ' y ' T) =e 2 , T (x,y). 

It follows from (A.2) that 



i m 2sin(c T A) 

/2,t(A) (1 - V(A)) 



2(Ay + i — Ay) 
3i*r 



where cr = (2At + At+i)/3, and so /2,t tends to in S(R) as T — ► oo. By 
Fourier theory, /2 7 1 — > in <S(R) and so in 7£(R). Thus (see Proposition 5.8 
of [23]) e 2 ,T -> as T -> 00. By (A.3) this gives (2.14). 

Acknowledgment. I am grateful to J0rgen Ellegaard Andersen for sug- 
gesting that Proposition 2.1 could be proved by adapting a proof of the 
asymptotic behavior of the heat kernel. 
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